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Mosquito  repellency  data  on  acylpiperidines  derived  from  the  U.S. 
Department  of  Agriculture  archives  were  modeled  by  using  mo¬ 
lecular  descriptors  calculated  by  CODESSA  PRO  software.  An  arti¬ 
ficial  neural  network  model  was  developed  for  the  correlation  of 
these  archival  results  and  used  to  predict  the  repellent  activity  of 
novel  compounds  of  similar  structures.  A  series  of  34  promising 
A/-acylpiperidine  mosquito  repellent  candidates  (4a-4q')  were  syn¬ 
thesized  by  reactions  of  acylbenzotriazoles  2a-2p  with  piperidines 
3a-3f.  Compounds  (4a-4q')  were  screened  as  topically  applied 
mosquito  repellents  by  measuring  the  duration  of  repellency  after 
application  to  cloth  patches  worn  on  the  arms  of  human  volun¬ 
teers.  Some  compounds  that  were  evaluated  repelled  mosquitoes 
as  much  as  three  times  longer  than  /V,A/-diethyl-m-toluamide 
(DEET),  the  most  widely  used  repellent  throughout  the  world.  The 
newly  measured  durations  of  repellency  were  used  to  obtain  a 
superior  correlation  equation  relating  mosquito  repellency  to  mo¬ 
lecular  structure. 

/V-acylpiperidine  |  quantitative  structure-activity  relationship  ] 

CODESSA  PRO  |  artificial  neural  network  |  Aedes  aegypti 

Mosquito-borne  diseases  such  as  malaria,  arboviral  enceph¬ 
alitis,  dengue  fever,  Rift  Valley  fever,  and  yellow  fever 
produce  significant  morbidity  and  mortality  in  humans  and 
livestock  in  many  parts  of  the  world  (1).  Pathogens  that  cause 
these  diseases  are  transmitted  by  injection  of  saliva  into  suscep¬ 
tible  hosts  by  female  mosquitoes  needing  protein  from  a  blood 
meal  to  develop  their  eggs.  Repellents  play  a  vital  role  in 
interrupting  this  mosquito/human  interaction  by  serving  as  a 
means  of  personal  protection  by  reducing  bites  from  mosquitoes. 
Although  vaccines  and  genetically  modified  mosquitoes  are 
under  development  for  prevention  of  mosquito-borne  diseases, 
new  and  improved  topical  repellents  are  needed  to  provide 
alternatives  that  are  safer,  effective  for  a  longer  duration,  and  more 
efficacious  against  mosquitoes  and  a  wide  range  of  arthropods. 

Some  repellents  that  are  applied  to  the  skin  or  clothing  are 
highly  effective  but  only  of  limited  duration  because  of  evapo¬ 
rative  loss,  dermal  absorption,  abrasive  loss,  or  dissolution  in 
water  and  perspiration  (2,  3).  Furthermore,  some  compounds 
cause  skin  irritation  or  a  stinging  sensation  when  they  contact 
eyelids  or  lips.  Finally,  several  are  successful  at  preventing  bites 
only  when  they  are  present  on  the  skin  or  clothing  in  relatively 
large  quantities.  Thus,  the  current  research  is  driven  by  the  need 
to  discover  a  new  generation  of  repellents  that  overcome  these 
limitations  and  supplement  or  replace  today’s  standard  repellent 
( /V, A'- die  t  hyl-m  -  to  lu  amide ,  DEET). 

Insects  are  believed  to  detect  repellents  through  receptor 
uptake  of  molecules  with  specific  chemical  characteristics  (4-7). 
Therefore,  we  have  conducted  extensive  studies  of  the  relation¬ 
ship  between  molecular  structure  and  the  observed  biological 
property  of  repellency.  Our  approach  is  based  on  a  rational 
design  similar  to  that  used  by  the  pharmaceutical  industry  for 
drug  development  but,  in  this  case,  is  specifically  aimed  at 
products  that  interfere  with  mosquito  olfaction  and/or  deter 


biting  of  a  host.  The  use  of  quantitative  structure -activity 
relationship  (QSAR)  approaches  to  repellent  discovery  is  rela¬ 
tively  uncommon  (8).  In  one  of  the  most  recent  studies,  three- 
dimensional  (3D)-QSAR  was  used  successfully  with  CATALYST 
software  to  develop  models  for  repellents  based  on  pharma¬ 
cophores  (8).  We  have  chosen  to  use  artificial  neural  network 
(ANN)  modeling  because  it  is  one  of  the  most  efficient  QSAR 
approaches.  Neural  networks  have  been  applied  in  many  diverse 
scientific  endeavors,  including  economics,  engineering,  physics, 
chemistry,  and  medical  science  (9).  A  particular  advantage  of 
ANNs  is  their  inherent  ability  to  incorporate  nonlinear  depen¬ 
dencies  between  the  dependent  and  independent  variables  with¬ 
out  using  an  explicit  mathematical  function. 

Methodology  for  a  general  QSAR  approach  has  been  devel¬ 
oped  and  coded  as  the  CODESSA  PRO  software  package  (10). 
We  previously  examined  31  repellents  by  using  CODESSA  PRO 
(11)  and  here  extend  this  research  by  examining  available  data 
on  /V-acylpiperidines,  calculating  their  structure-activity  rela¬ 
tionships,  synthesizing  novel  compounds,  and  performing  repel¬ 
lency  assays  with  human  volunteers  (12). 

Results  and  Discussion 

Correlation  of  Existing  Mosquito  Repellency  Data  for  Al-acylpiperidines 
with  Chemical  Structures.  Dataset.  Early  investigations  on  the 
synthesis  of  insect  repellents  and  their  practical  tests  on  skin  and 
cloth  against  mosquitoes  and  other  biting  Diptera  have  been  well 
documented  (13-24).  In  the  present  work,  nonlinear  QSAR 
modeling  based  on  the  ANN  approach  was  performed  using 
available  data  for  200  /V-acylpiperidines.  The  efficacy  of  these 
compounds  was  determined  by  a  “time  point  of  failure”  defined 
as  a  specified  number  of  bites  (usually  a  low  number)  by  female 
Aedes  aegypti  (L.)  mosquitoes  through  cloth  treated  with  each 
candidate  piperidine  and  fixed  on  the  arm  of  a  volunteer  (12). 
The  original  data  on  piperidines  were  obtained  from  U.S. 
Department  of  Agriculture  (USDA)  records  collected  and  com¬ 
piled  over  the  past  50  years  (25-27).  Historically,  the  level  of 
repellency  was  divided  into  five  classes,  as  defined  by  “the  time 
to  first  bite.”  This  classification  system  placed  all  repellents  that 
were  efficacious  for  >21  d  into  class  5,  the  top  tier.  The 
remaining  classes  were  divided  as  class  4  for  10-  to  21-d 
protection,  class  3  for  5-  to  10-d  protection,  class  2  for  1-  to  5-d 
protection,  and  class  1  as  ineffective  (<l-d  protection). 
Nonlinear  QSAR  modeling.  The  objective  of  these  calculations  was  to 
build  a  predictive  ANN  model  able  to  classify  the  duration  of 
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repellent  efficacy  and  use  this  information  to  predict  structures 
of  novel  repellents.  The  data  set  was  divided  into  training  (150 
repellents)  and  validation  (50  repellents)  subsets.  The  main 
model  is  based  on  the  back  propagation  learning  algorithm  for 
optimization  of  the  ANN  (28,  29).  To  obtain  good  generalization 
of  the  ANN,  the  following  factors  were  taken  into  account: 

( i )  To  build  a  general  model,  150  of  the  200  compounds  [see 
supporting  information  (SI)  Table  SI  for  details]  were  randomly 
selected  to  form  the  training  set. 

(«)  The  preferred  architecture  of  the  ANN  model  (by  rms 
error)  was  8-7-1  (i.e.,  8  neurons  in  the  input  layer,  7  neurons  in 
the  hidden  layer,  and  1  neuron  in  the  output)  representing  the 
time  class  of  the  repellent. 

(Hi)  A  sensitivity  analysis  was  done  by  building  1-1-1  ANN 
models,  and  the  descriptors  that  showed  lowest  error  at  the 
output  were  selected.  In  addition,  scatter  plot  analysis  between 
the  descriptors  and  the  time  classes  was  carried  out  to  reveal  the 
variability  of  the  classes  with  respect  to  the  descriptors.  Finally, 
a  combination  of  eight  descriptors  was  selected  as  inputs  for  the 
ANN:  total  number  of  bonds,  molecular  weight,  Kier  and  Hall 
index  (order  3),  molecular  surface  area,  total  dipole  of  the 
molecule,  total  molecular  electrostatic  interaction,  surface  area 
for  atom  C,  and  surface  area  for  atom  N. 

The  training  procedure  of  the  ANN  was  stopped  when  the  rms 
error  of  the  validation  set  started  to  increase.  The  final  results  for 
the  coefficient  of  determination  and  rms  error  for  the  training 
set  were  0.73  and  0.87,  respectively.  The  calculated  rms  error  of 
the  validation  set  was  1.4.  As  expected,  the  rms  error  of  the 
validation  set  was  bigger  than  that  of  the  training  subset. 

Utilization  of  the  ANN  Model  to  Design  and  Select  New  Compounds  for 
Synthesis.  Discussion  of  the  ANN  model.  Based  on  the  results  ob¬ 
tained  by  the  ANN  model,  the  repellent  time  classes  for  addi¬ 
tional  compounds  were  predicted.  The  expected  accuracy  of 
prediction  of  the  repellency  class  for  a  new  compound  by  using 
the  results  on  measured  compounds  is  given  by  NJN,  where  Ns 
is  the  number  of  the  repellents  predicted  by  the  network  at  this 
particular  class  and  N  is  the  total  number  of  the  repellents  in  a 
given  experimental  class.  Thus,  the  accuracies  of  each  class  are 
as  follows  (see  Fig.  SI  for  more  information):  class  1,  0.76;  class 
2,  0.69;  class  3,  0.50;  class  4,  0.25;  and  class  5,  0.35.  If  classes  4 
and  5  are  combined  into  a  single  class,  the  exact  number  of 
compounds  predicted  from  class  4  and  class  5  combined  is  39. 
Therefore,  the  predicted  accuracy  for  class  4  and  class  5  com¬ 
pounds  is  71%  (39/55).  This  prediction  accuracy  is  quite  high, 
considering  the  distribution  of  the  class  values.  Therefore,  this 
ANN  model  can  be  used  with  some  confidence  to  predict  novel 
candidate  repellents. 

Selecting  compounds  for  synthesis.  Good  correlation  existed  be¬ 
tween  the  experimental  and  predicted  values  of  the  repellent 
efficacy.  Of  the  34  compounds  selected  for  synthesis,  11  of  these 
(see  Table  1  and  Table  S2  for  more  details  and  refs.  20  and 
30-33)  were  chosen  from  those  examined  previously.  The  re¬ 
pellency  of  these  11  compounds  was  reassessed  to  (;)  compare 
results  obtained  from  the  biological  test  procedure  to  experi¬ 
mental  results  of  those  conducted  and  reported  previously  (26); 
(«)  conduct  a  rigorous  assessment  of  the  testing  protocol  using 
new  analogs  and  thereby  to  gain  confidence  in  the  utility  of  the 
output  data;  and  (Hi)  increase  predictive  ability  and  accuracy  by 
reexamination  of  the  most  effective  repellent  candidates  iden¬ 
tified  as  class  4  and  class  5  from  previous  assays  and  then  use  the 
protocols  to  further  divide  the  repellency  of  these  compounds 
into  additional  subclasses. 

At  present,  the  activities  of  «=2,000  untested  analogs  have  been 
predicted.  A  total  of  23  were  selected  from  this  set  for  synthesis. 
Using  the  ANN  model,  we  predicted  these  compounds  to  be 
principally  members  of  repellent  classes  3,  4,  and  5  (see  Table  1 
and  Table  S3  for  details,  and  refs.  15,  20,  and  33-38).  Thus, 


together  with  the  11  compounds  previously  examined  (see  Table 
1)  with  activity  predicted  by  the  ANN  as  classes  4  and  5,  we 
synthesized  a  total  of  34  compounds  of  general  structure  4  as 
shown  in  Fig.  1. 

Synthesis  of  Additional  N-acylpiperidines.  Introduction.  /V-acylpiperi- 
dines  have  generally  been  synthesized  by  reaction  of  acid  chlo¬ 
rides  with  amines  (either  the  amine  itself  or  pyridine  was  used 
as  a  hydrochloric  acid  scavenger).  The  product  amides  are  then 
isolated  by  routine  extraction  procedures  and  purified  by  crys¬ 
tallization,  chromatography,  or  distillation  under  high  vacuum. 

The  high  reactivity  of  acid  halides  and  their  incompatibility 
with  acid-sensitive  functionalities  prompted  efforts  to  find  al¬ 
ternative  methods.  A  benzotriazolyl  group  is  easily  introduced, 
activates  molecules  toward  numerous  transformations,  and  can 
be  removed  easily  at  the  end  of  each  reaction  sequence  (39-44). 
1-Acylbenzotriazoles  are  advantageous  carboxylic  acid  deriva¬ 
tives  because  they  are  stable  and  readily  available  in  one  step 
from  carboxylic  acids  even  where  an  acid-sensitive  functionality 
is  present  (45-47).  We  report  herein  the  use  of  this  method  in 
the  synthesis  of  A-acylpiperidines  as  candidates  for  mosquito 
repellents. 

Synthetic  results  and  discussion.  1-Acylbenzotriazoles  2  (Fig.  l)were 
produced  by  treatment  of  the  corresponding  carboxylic  acids  1 
at  25°C  with  thionyl  chloride  and  benzotriazole  in  methylene 
chloride  in  1:1:3  mole  ratio  (48). 

Reaction  of  1-acylbenzotriazoles  2  with  1  eq  of  piperidines  3 
in  THF  at  20°C  or  in  toluene  under  reflux  resulted  in  formation 
of  A-acylpiperidines  4  by  our  modified  procedure  (46).  Subse¬ 
quent  evaporation  of  the  solvent,  addition  of  methylene  chloride, 
and  washing  with  aqueous  sodium  carbonate  solution  gave 
/V-acylpiperidines  4a-4q'  in  71-100%  yield  (Tables  S2  and  S3). 

Experimental  (if  available)  and  predicted  classes  of  repellency 
according  to  the  ANN  model  are  given  in  Table  SI.  From  these 
data,  the  11  candidates  reselected  for  synthesis  are  listed  in  Table 
1  and  Table  S2.  These  compounds  have  the  highest  measured 
activity  (i.e.,  class  5)  and  also  possess  high  activities  predicted  by 
the  ANN  model. 

Another  23  compounds  were  synthesized  as  repellent  candi¬ 
dates  (Table  1  and  Table  S3).  Of  these,  17  (74%)  were  predicted 
by  ANN  as  highly  active  repellents  falling  in  classes  4  and  5, 
which  in  theory  provide  protection  from  mosquito  bites  for  >10 
d  when  applied  to  cloth  worn  over  the  skin.  For  controls,  we  also 
synthesized  six  compounds  with  relatively  poor  (classes  1  and  2) 
predicted  repellency  (4a-4f,  Table  1,  Table  S3).  Measurements 
of  the  biological  activity  of  all  these  compounds  are  described  in 
the  section  on  biological  testing. 

NMR  Spectra  of  /V-acylpiperidines.  The  proton  spectra  of  the 
unsymmetrical  amides  4a,  4c,  4h,  4i,  41,  4p,  4q,  4c',  4h'-k',  4n', 
and  4o'  all  revealed  hindered  rotation  about  the  N-CO  bond  on 
the  NMR  time  scale.  Detailed  variable  temperature  NMR 
spectra  of  4c,  4p',  and  4q'  gave  AG *  values  of  16.2  ±  0.3 
kcal-mol-1  for  interconversion  of  the  rotamers,  corresponding  to 
a  first-order  rate  constant  of  «=30  s-1  at  ambient  temperature. 
Thus,  hindered  rotation  is  not  expected  to  influence  the  biolog¬ 
ical  activity  (see  Si  Text,  Figs.  S2-S5,  and  Table  S4  for  more 
details). 

Biological  Testing.  Initially,  mass  concentrations  (12)  were  used  at 
the  start  of  experiments,  but  a  modification  of  the  initial  experi¬ 
mental  setting  was  necessary  so  that  the  current  modeling  scheme 
would  meet  one  of  the  following  QSAR  conditions  (i)  measure¬ 
ment  of  the  induced  biological  effect  at  a  constant  concentration  or 
(ii)  measurement  of  the  concentration  that  causes  a  constant 
biological  effect.  The  first  condition  is  closer  to  the  actual  appli¬ 
cation  rate  of  repellents  in  the  field  and  thus  was  chosen  for  further 
study.  The  experimental  determination  of  repellency  was  based  on 
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Table  1.  Predicted  ANN  class,  averaged  experimental  protection  time  (PT)  from  tests  1  and  2,  standard  deviation  of  the  data, 
and  converted  class 


Compound 

Predicted 

ANN  class 

*■ 

PT,  d, 

at  25  /xmol/cm2 

Conv. 

class 

PT, 

d,  at  2.5  /xmol/cm2 

Conv. 

class 

ID  no. 

R 

R' 

Test  1 

Test  2 

Average 

Test  1 

Test  2 

Average 

DEET 

14 

21 

17.5 

4 

2 

3 

2.5 

2 

4a 

Me 

2-Me 

4.82 

5 

1 

3 

2 

2 

1 

3 

2 

2 

4b 

Et 

H 

4.56 

5 

7 

3 

5 

2 

1 

7 

4 

2 

4c 

Et 

2-Et 

4.10 

4 

3 

7 

5 

2 

3 

3 

3 

2 

4d 

/■7-C6H13 

2-Me 

3.14 

3 

17 

17 

17 

4 

3 

7 

5 

2 

4e 

n-C  6H13 

3-Me 

3.11 

3 

14 

17 

15.5 

4 

7 

8 

7.5 

3 

4f 

n-C7H15 

4-Me 

2.69 

3 

43 

53 

48 

5 

9 

7 

8 

3 

4g 

n-C7H15 

4-Bn 

1.82 

2 

9 

17 

13 

4 

7 

7 

7 

3 

4h 

n-CsHi7 

2-Et 

3.01 

3 

23 

63 

43 

5 

10 

9 

9.5 

3 

4i 

D-CgHig 

2-Me 

2.85 

3 

21 

78 

49.5 

5 

9 

7 

8 

3 

4j 

D-CgHig 

4-Me 

2.78 

3 

23 

59 

41 

5 

9 

14 

11.5 

4 

4k+ 

CH2=CH(CH2)s 

H 

3.87 

4 

37 

63 

50 

5 

10 

17 

13.5 

4 

41 

CH2=CH(CH2)s 

2-Et 

1.56 

2 

21 

85 

53 

5 

9 

9 

9 

3 

4m 

CH2=CH(CH2)s 

4-Bn 

1.11 

1 

7 

10 

8.5 

3 

7 

9 

8 

3 

4n 

CH2=CH(CH2)s 

4-Me 

2.26 

2 

73 

73 

73 

5 

0 

21 

10.5 

4 

4o 

n-CioH2i 

H 

2.12 

2 

23 

56 

39.5 

5 

9 

17 

13 

4 

4p 

n-CiiH23 

2-Me 

3.63 

4 

7 

22 

14.5 

4 

3 

7 

5 

2 

4q 

n-CnH23 

3-Me 

1.64 

2 

10 

29 

19.5 

4 

3 

8 

5.5 

3 

4a ,f 

1-c-C6H9 

H 

4.87 

5 

17 

17 

17 

4 

3 

7 

5 

2 

4b'f 

C-C6H,i 

H 

5.21 

5 

14 

14 

14 

4 

7 

9 

8 

3 

4c't 

c-C6H„ 

3-Me 

5.01 

5 

17 

17 

17 

4 

3 

9 

6 

3 

4d't 

c-C6H„ 

4-Me 

5.02 

5 

28 

21 

24.5 

5 

7 

10 

8.5 

3 

4e'f 

c-C5H9(CH2)2 

H 

4.33 

4 

28 

42 

35 

5 

9 

9 

9 

3 

4f'+ 

1-Me-c-C6H10 

3-Me 

3.98 

4 

10 

14 

12 

4 

7 

7 

7 

3 

4g' 

4-Me-c-C6H10 

2-Me 

4.98 

5 

28 

38 

33 

5 

8 

9 

8.5 

3 

4h't 

c-C6H„ 

2-Et 

5.23 

5 

21 

22 

21.5 

5 

7 

7 

7 

3 

4i'f 

c-C6H„CH2 

2-Me 

4.55 

5 

24 

35 

29.5 

5 

7 

8 

7.5 

3 

4jft 

c-C6H„(CH2)2 

2-Me 

3.56 

4 

29 

66 

47.5 

5 

10 

10 

10 

3 

4k'f 

c-C6Hh(CH2)2 

3-Me 

3.62 

4 

14 

56 

35 

5 

7 

11 

9 

3 

41' 

c-C6H„(CH2)2 

4-Me 

4.03 

4 

28 

63 

45.5 

5 

7 

9 

8 

3 

4m' 

c-C6H11(CH2)3 

4-Me 

3.66 

4 

10 

56 

33 

5 

3 

3 

3 

2 

4n' 

c-CsH9(CH2)2 

2-Et 

4.74 

5 

23 

58 

40.5 

5 

7 

10 

8.5 

3 

4o' 

c-C6H„(CH2)2 

2-Et 

4.34 

4 

21 

63 

42 

5 

0 

21 

10.5 

4 

4p 

c-C6H„CH2 

4-Bn 

3.56 

4 

3 

3 

3 

2 

0 

3 

1.5 

2 

4q' 

c-C6H„(CH2)2 

4-Bn 

2.98 

3 

7 

17 

12 

4 

1 

1 

1 

1 

Bn,  benzyl. 

*ANN  predicted  class,  three  significant  figures  on  the  left,  rounded  on  the  right. 

Repellents  that  had  previously  been  evaluated  for  repellent  classes;  all  others  are  candidate  repellents  synthesized  for  this  study. 


two  tests  per  compound  performed  at  2.5  and  25  /xmol/cm2 
concentrations,  and  average  protection  times  (d)  were  derived  from 
the  two  tests  (see  Table  1). 


Ah 

R  =  alkyl 
or  aryl 

1 


BtH,  SOCI2 
CH2CI2 

Ht> 

3 


O 


R  Bt 
2 

THF,  25  °C  or 
PhMe,  reflux 


Fig.  1.  Synthetic  scheme  for  the  preparation  of  W-acylpiperidines  4.  Bt, 
benzotriazolyl. 


The  repellent  4-methyl-l-(l-oxooctyl)piperidine  (4f)  (at  25 
/xmol/cm2)  failed  on  day  43  for  the  female  volunteer,  whereas  the 
male  had  a  repellent,  2-ethyl-l-(l-oxo-10-undecylenyl)piperi- 
dine  (41)  that  provided  protection  for  85  d.  With  both  the  male 
and  female  volunteers  repellency  persisted  for  73  d  at  25 
/xmol/cm2  with  4-methyl-l-(l-oxo-10-undecylenyl)piperidine 
(4n).  Repellent  41  was  the  most  potent  toxicant  as  measured  by 
LD50  of  the  repellents  examined  by  Pridgeon  et  al.  (49).  For 
comparison,  and  highly  illustrative  of  the  efficacy  of  these 
candidate  repellents,  the  failure  point  (protection  time)  for  25 
/xmol/cm2  DEET  on  cloth  averaged  17.5  d  for  this  screening 
assay.  At  the  low  dose  of  2.5  /xmol/cm2,  repellent  4k,  l-(l-oxo- 
10-undecylenyl)piperidine,  averaged  13.5-d  protection  com¬ 
pared  with  only  2.5-d  protection  for  the  2.5  /xmol/cm2  DEET 
standard. 

Statistical  examination  of  the  average  protection  times  (PT) 
revealed  several  important  characteristics. 

( i )  The  distribution  of  the  data  at  2.5  /xmol/cm2  is  close  to 
normal  (Gaussian),  whereas  the  one  at  25  /xmol/cm2  deviates 
slightly  (see  Fig.  2). 

2)  Perhaps  because  most  of  the  compounds  measured  were 
highly  active,  the  experimental  PT  values  at  the  lower  concen- 
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Fig.  2.  Histograms  and  probability  density  functions  of  the  averaged  pro¬ 
tection  time  (PT)  at  25  fxmol/cm2  (a)  and  2.5  /unol/cm2  (b). 


tration  of  2.5  pmol/cm2  are  more  precise  than  those  at  25 
/xmol/cm2.  Hence,  the  QSAR  modeling  results  at  the  lower 
concentration  are  expected  to  be  more  reliable. 

Comparison  of  the  Present  Bioassay  Values  with  the  Archival  Data. 

The  dataset  of  34  N-acylpiperidines  with  averaged  PT  trans¬ 
formed  into  classes  was  used  to  estimate  the  predictive  power  of 
ANN.  No  significant  correlation  was  found  between  the  exper¬ 
imental  PT  values  and  those  predicted  by  ANN  at  either  25  or 
2.5  pimol/cm2  (columns  5  vs.  10  and  5  vs.  15  produced  R2  =  0.007 
andR2  =  0.06,  respectively).  The  possible  mismatch  between  the 
ANN-predicted  classes  (based  on  archival  data)  and  those 
measured  in  the  present  work  could  be  due  to  the  significant 
difference  in  the  experimental  settings:  mass  instead  of  molar 
concentrations  used,  fewer  mosquitoes  tested  (500  instead  of 
2,000-4,000),  different  cage  size,  and  use  of  a  new  PT  definition 
(see  Methods). 

We  conclude  that  the  use  of  classes  as  such  is  not  appropriate 
and  does  not  provide  sufficient  levels  to  clearly  discriminate  the 
repellency  effectiveness  of  these  compounds.  We  now  use  the 
average  days  of  protection  (duration)  as  a  more  precise  basis  for 
QSAR  modeling. 

New  QSAR  Modeling.  By  using  the  Best  Multilinear  Regression 
(BMLR)  algorithm  integrated  into  CODESSA  PRO 
[CODESSA  PRO  Software,  University  of  Florida  (2002),  ww- 
w.codessa-pro.com],  QSAR  models  with  up  to  seven  descriptors 
[the  maximum  allowed  by  “5-to-l”  rule  of  thumb  (50)]  were 
generated  for  both  concentrations  of  the  candidate  repellents. 
As  can  be  seen  from  Fig.  3,  the  models  with  four  and  more 
descriptors  are  characterized  by  very  close  statistical  parameters. 
Following  the  “Occam’s  Razor”  rule  of  simplicity,  QSAR  mod¬ 
els  with  four  descriptors  were  preferred  and  considered  further 
(Table  2  and  Fig.  4),  thus  making  the  comparison  of  the  two 
models  easier.  Using  four  descriptors,  we  obtain  superior  R2 
values  of  0.729  and  0.689  for  the  25  /amol/cm2  and  2.5  /amol/cm2 
concentration  data  sets,  respectively. 

As  can  be  seen  from  Table  2,  the  statistical  parameters 
obtained  in  both  cases  are  close,  whereas  the  descriptors  that 
appear  in  the  two  models  are  quite  different.  The  reasons  for 
such  behavior  may  be  as  follows: 

(i)  Data  distribution:  a  subtle  distinction  in  the  data  distribu¬ 
tion  could  lead  to  selection  of  a  different  descriptor  set  that  fits 
the  experimental  data  best. 

(ii)  High  intercorrelation:  at  each  consecutive  step  the  BLMR 
algorithm  selects  only  one  of  a  pair  or  a  set  of  highly  intercor- 
related  descriptors,  which  is  considered  further.  Thus,  depending 
on  the  data  set,  different  but  physically  similar  and  highly 
intercorrelated  descriptors  (Table  S5)  may  appear  in  the  differ¬ 
ent  models. 

The  descriptors  introduced  into  the  two  models  (Table  2)  may 
be  classified  as  ( i )  electrostatic  interaction-related  descriptors: 
RNCG  relative  negative  charge  (QMNEG/QTMINUS),  mini¬ 
mum  e-n  attraction  for  bond  C-O,  WPSA-2  weighted  PPSA 
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Fig.  3.  Plot  of  R2  vs.  number  of  descriptors  used. 


(PPSA2*TMSA/1000),  maximum  f-electron  reactivity  index  for 
atom  C,  and  maximum  e-e  repulsion  for  bond  C-C;  and  (ii) 
steric  interactions-related  descriptors:  YZ  shadow/YZ  rectangle, 
molecular  volume/XYZ  box,  principal  moment  of  inertia  Z. 

The  mechanism  of  repellent  action  is  known  to  be  receptor- 
based,  possibly  involving  the  GPRor7  receptor  (until  recently,  it 
was  designated  as  AgOR7)  on  mosquito  antennae  as  a  target  for 
the  bioactive  molecules  (51).  The  first  class  of  descriptors  could 
be  related  to  orientation  effects  and  to  the  possibility  of  ligand- 
receptor  noncovalent  bonds.  Most  of  the  electrostatic  descrip¬ 
tors  depict  the  distribution  of  negative  charge  within  the  mole¬ 
cule  and  are  probably  connected  to  the  heteroatoms  present, 
such  as  oxygen  and  nitrogen.  Especially  the  “Minimum  e-n 
attraction  for  bond  C-O”  descriptor  shows  how  important  the 
presence  of  a  carbonyl  group  is  for  the  repellent  effect  and 
indicates  that  the  stronger  the  -C=0  bond  the  longer  the 
protection  time. 

The  second  set  of  descriptors  encoding  the  mass  distribution 
and  the  size  of  the  molecules  probably  depict  the  steric  inter¬ 
actions  that  are  responsible  for  the  surface  recognition  between 
the  ligand  and  receptor.  The  structures  identified  on  the  basis  of 
our  models  as  highly  active  at  both  concentrations  (4k,  4j,  4i,  4o, 
4n,  4o',  4f,  41,  4j',  and  4h)  (Table  1  and  Table  S6)  probably 
include  all  the  structural  features  essential  for  repellency. 

Conclusion 

This  report  documents  significant  findings  in  the  area  of  repel¬ 
lent  research  through  the  application  of  a  combination  of 
techniques  and  methods  from  the  disciplines  of  medical  ento¬ 
mology  and  synthetic  and  theoretical  chemistry.  Models  were 
constructed  by  using  a  subset  of  30,000  chemicals  accumulated 
in  the  USDA  archives  over  the  last  60  years.  The  repellency 
assays  originally  used  to  study  these  archived  chemicals  were  the 


£  58 

a> 

E  48 

•■s 

c 

.2  38 

<u 

O  28 


25  pmol/cm2 


2.5  pmol/cm2 


Experimental  protection  time  (in  days) 


Experimental  protection  time  (in  days) 


Fig.  4.  Predicted  vs.  observed  protection  time  values  at  concentrations  25 
and  2.5  /xmol/cm2. 
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Table  2.  Best  four  descriptors  models  and  their  statistical  parameters 


Cone.,  No.  of 

/rmol/cm2  descriptors  B  St  1C  Name  of  descriptor 


25+ 

0 

-188.8 

84.08 

-2.246 

Intercept 

1 

-2686 

461.3 

-5.823 

0.09647 

Maximum  1-electron  reactivity  index  for  atom  C 

2 

-2616 

488.2 

-5.359 

0.7253 

Principal  moment  of  inertia  C 

3 

2.040 

0.6920 

2.948 

0.3632 

Maximum  e-e  repulsion  for  bond  C-C 

4 

-0.02195 

0.009215 

-2.382 

0.7759 

WPSA-2  weighted  PPSA  (PPSA2*TMSA/1000) 

2.5* 

0 

-726.1 

329.3 

-2.205 

Intercept 

1 

-68.13 

9.393 

-7.254 

0.5248 

YZ  shadow/YZ  rectangle 

2 

58.50 

13.22 

4.426 

0.7120 

Molecular  volume/XYZ  box 

3 

-71.37 

16.41 

-4.350 

0.5696 

RNCG  relative  negative  charge  (QMNEG/QTMINUS) 

4 

1.870 

0.8053 

2.321 

0.2822 

Minimum  e-n  attraction  for  bond  C-O 

B,  regression  coefficient;  S,  regression  coefficient  error;  t,  Student  criterion;  1C,  partial  intercorrelation;  PPSA,  partial  positively  charged  molecular  surface  area; 
WPSA,  weighted  PPSA;  RNCG  relative  negative  charge,  ratio  between  the  maximum  atomic  negative  charge  and  sum  of  the  negative  atomic  charges  in  the 
molecule. 

*/V  =  4;  n  =  34;  R 2  =  0.729;  R?v00  =  0.638;  R?»mo  =  0-628;  F  =  19.50;  s  =  9.769. 

*N  =  4  ;n  =  34;  R2  =  0.689;  R^voo  =  0.608;  R2„M0  =  0.582;  F  =  16.05;  s  =  1.815. 


same  as  those  that  led  to  the  discovery  of  DEET  in  1953,  the 
most  widely  used  repellent  in  the  United  States.  The  appeal  of 
DEET  as  the  “gold  standard”  is  based  on  its  excellent  human-use 
safety  record,  its  ability  to  protect  humans  from  bites  of  a  wide 
range  of  hematophagous  arthropods,  and  its  duration  of  pro¬ 
tection  on  the  skin — the  measure  of  merit  for  the  original 
repellency  assays  and  for  those  used  in  the  current  study.  While 
some  newer  commercial  repellents  are  nearly  as  efficacious  as 
DEET-based  repellents  with  respect  to  their  protection  from 
insect  bites,  to  the  best  of  our  knowledge,  none  protect  users  for 
significantly  longer  than  equivalent  stoichiometric  or  gravimetric 
amounts  of  DEET. 

In  this  study,  we  performed  a  successful  closing  of  the 
“QSAR-synthesis-bioassay”  cycle.  Using  the  original  data,  lin¬ 
ear  and  nonlinear  QSAR  approaches  were  applied  consecutively 
to  two  series  of  200  and  34  compounds,  respectively.  On  the  basis 
of  this  model,  we  identified  23  compounds  for  synthesis  and 
study  that  we  expected  to  be  as  efficacious  as  DEET.  Modifi¬ 
cation  of  the  original  repellency  assays,  including  application  of 
chemicals  in  stoichiometrically  equivalent  amounts  and  convert¬ 
ing  from  a  class  system  to  recording  of  actual  days  of  protection, 
confirmed  that  most  of  these  novel  acylpiperidines  were  equiv¬ 
alent  to  or  better  than  DEET  in  duration  of  protection.  Aston¬ 
ishingly,  a  number  of  these  protected  >3  times  as  long  as  DEET. 
Finally,  the  new  repellency  data  were  incorporated  into  struc¬ 
ture-activity  models  and  produced  exceptional  correlation  co¬ 
efficients.  Modifications  and  retraining  of  the  model  will  be 
expected  as  we  assess  the  chemical-structural  impact  of  new 
classes  of  compounds  on  mosquito  repellent  efficacy. 

Methods 

Synthesis.  See  SI  Text. 

Biological  Testing.  Laboratory-reared  Aedes  aegypti  mosquitoes  (Orlando 
strain,  1952)  were  obtained  from  the  USDA-Agricultural  Research  Service 
Gainesville  colony.  Adults  were  provided  with  sugar  and  water  and  main¬ 
tained  nulliparous  in  laboratory  cages  at  an  ambient  temperature  of  28  ±  1°C 
and  relative  humidity  of  35-60%.  Repellency  assays  were  conducted  in  45 
cm  x  37.5  cm  X  35  cm  (=59,000  cm3)  cages  containing  500  ±  50  mosquitoes 
(5-10  d  old).  Female  mosquitoes  were  preselected  from  those  that  did  not 
display  host-seeking  behavior  and  from  males  by  capture  in  a  trap  after 
anemotaxis  toward  odors  from  a  human  hand  in  a  draw  box  (52)  and  then 
were  transferred  to  a  test  cage  for  acclimatization  for  15-20  min  before 
bioassays. 

Two  concentrations  of  each  candidate  repellent  were  prepared  in  a  2-dram 
vial  with  acetone  (1  ml)  as  the  solvent.  An  appropriate  amount  of  each 


piperidine  was  added  to  the  vials  to  produce  a  deposition  of  25  /xmol  and  2.5 
jLtmol  of  each  piperidine,  respectively,  per  cm2  on  a  50  cm2-section  of  muslin 
cloth  that  was  inserted  into  the  vial.  Each  cloth  and  solution  were  kept  sealed 
in  vials  and  stored  in  a  freezer  at  -4°C  until  used  on  "day  0."  On  day  0,  each 
cloth  was  removed  from  its  vial  and  mounted  on  two  sections  of  5  cm  x  2.5  cm 
card  stock.  Masking  tape  was  applied  to  the  card  stock  edges  and  the  assembly 
was  hung  from  a  rack  and  allowed  to  dry  for  at  least  1  h. 

Two  volunteers  (one  male  and  one  female)  participated  in  this  study.  The 
protocol  was  approved  as  project  636-2005  by  the  University  of  Florida  Human 
Use  Institutional  Review  Board-01.  Informed  consent  was  obtained  for  sub¬ 
jects  before  participation  in  this  study.  Bioassays  were  conducted  by  covering 
the  volunteer's  hand  with  a  soft-embossed  long  cuff  poly  glove  (Atlantis 
Products)  and  then  covering  the  same  hand  with  a  powder-free  latex  glove 
(Diamond  Grip,  Microflex).  A  stocking  (Leggs  everyday  knee  highs)  was  then 
pulled  over  the  hand  and  arm  to  cover  the  skin  surface  of  the  arm.  A  polyvinyl 
plastic  sleeve,  with  a  Velcro  seam  to  seal  the  edges  around  the  arm,  and  with 
a  3  x  8  cm  opening  (i.e.,  window)  cut  into  the  plastic  approximately  half  way 
between  the  wrist  and  elbow  end  of  the  sleeve,  was  fastened  around  the  arm. 
Each  cloth  patch  assembly  was  affixed,  one  at  a  time,  over  the  open  window 
with  masking  tape  to  hold  it  in  place  on  the  sleeve.  This  allowed  volatile 
kairomones  to  pass  from  the  skin  surface  through  the  window  opening  and 
attract  mosquitoes  which  might,  if  the  treatment  was  not  sufficiently  repel¬ 
lent,  bite  through  the  cloth  in  the  open  window.  All  2.5  /xmol/cm2  samples 
were  assayed  before  the  25  /xmol/cm2-treated  patches;  however,  the  order 
within  each  concentration  group  was  random,  differing  not  only  for  the 
individual  volunteers  but  also  day-to-day  for  a  single  individual. 

The  arm  and  affixed  cloth  patch  were  inserted  into  the  cage  of  mosquitoes 
and  held  stationary  for  1  min  to  determine  whether  the  cloth  patch  was 
repellent.  The  number  of  feeding  mosquitoes  was  counted  before  removal  of 
the  arm  with  a  quick,  brisk  shaking  movement.  This  procedure  was  repeated 
daily  until  the  failure  threshold  was  reached.  Feeding  mosquitoes  that  re¬ 
mained  on  the  window  were  considered  to  be  biting.  A  maximum  of  10 
different  repellents  were  assayed  with  the  same  group  of  preselected  mos¬ 
quitoes.  Each  additional  set  of  10  candidate  repellents  was  run  in  a  separate 
cage  of  =500  mosquitoes.  This  procedure  minimizes  the  fatigue  and  attenu¬ 
ated  response  of  mosquitoes  subjected  to  repeated  exposures  to  repellents,  it 
and  avoids  depletion  of  sufficient  numbers  of  fresh  mosquitoes  to  conduct 
bioassays  with  all  compounds  at  both  concentrations  from  day  0  onward. 

The  failure  threshold  for  repellency  for  these  experiments  was  established 
as  1  %  biting  (5  bites)  and  confirmed  by  achievement  of  two  consecutive  days 
of  5  or  more  bites.  The  failure  point  was  ultimately  recorded  as  the  first  day, 
rather  than  the  second  day,  that  5  bites  were  achieved  through  a  repellent- 
treated  cloth. 
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